Environmental statistics and the trade-off between model-based and TD learning in humans
نویسندگان
چکیده
There is much evidence that humans and other animals utilize a combination of model-based and model-free RL methods. Although it has been proposed that these systems may dominate according to their relative statistical efficiency in different circumstances, there is little specific evidence — especially in humans — as to the details of this trade-off. Accordingly, we examine the relative performance of different RL approaches under situations in which the statistics of reward are differentially noisy and volatile. Using theory and simulation, we show that model-free TD learning is relatively most disadvantaged in cases of high volatility and low noise. We present data from a decision-making experiment manipulating these parameters, showing that humans shift learning strategies in accord with these predictions. The statistical circumstances favoring model-based RL are also those that promote a high learning rate, which helps explain why, in psychology, the distinction between these strategies is traditionally conceived in terms of rulebased vs. incremental learning.
منابع مشابه
Relationship between Environmental Quality and Economic Growth in Developing Countries (based on Environmental Performance Index)
In order to evaluate the development levels of countries, economic growth along with environmental quality account for important indices nowadays. The impacts of environmental quality (based on environmental performance index), the direct foreign investment, and trade openness on economic growth in selected developing countries have been scrutinized in the present study. In the present study th...
متن کاملDRIVING OPTIMUM TRADE-OFF BETWEEN THE BENEFITS AND COSTS OF INTERBASIN WATER TRANSFER PROJECTS
The interbasin water transfer is a remedy to mitigate the negative issues of water shortage in arid and semi-arid regions. In a water transfer project the receiving basin always benefits while, the sending basin may suffer. In this study, the project of interbasin water transfer from Dez water resources system in south-west of Iran to the central part of the contrary is investigated during...
متن کاملEconomic Competitiveness and Environmental Policy: An Application of the Heckscher-Ohlin-Vanek (HOV) Model
The relationship between trade liberalisation and the environment has been the subject of a growing body of literature in recent years. As can be seen from the differing assessment of instrument types for environmental protection, one of the important factors for the relationship between environmental protection and economic competitiveness are regulatory stringency and efficiency. This concern...
متن کاملPrediction of the vegetation management impacts on reduction of wind erosion risk in the southern parts of the Varamin Plain, Iran
Wind erosion is a major environmental issue affecting land resources and socio-economic settings in Iran. This paper outlines a study undertaken to provide a new tool to manage wind erosion from physical and economic perspectives. The southern part of the Varamin Plain in south of Tehran is used as a case study. The focus of this study is on exploring the economic and physical impacts of 16 veg...
متن کاملA Multi-Mode Resource-Constrained Optimization of Time-Cost Trade-off Problems in Project Scheduling Using a Genetic Algorithm
In this paper, we present a genetic algorithm (GA) for optimization of a multi-mode resource constrained time cost trade off (MRCTCT) problem. The proposed GA, each activity has several operational modes and each mode identifies a possible executive time and cost of the activity. Beyond earlier studies on time-cost trade-off problem, in MRCTCT problem, resource requirements of each execution mo...
متن کامل